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POPULATION COUNTS USING A COMPUTER DATA BANK 
R.E. Davies, M.A. 



1. Introduction 

This report describes a computer system to count 
population within user specified areas within the UK. The 
setting up of a computer-based file of population within a 
set of half-kilometre grid squares covering the UK was 
described in a previous report. The data were based upon 
1966 census figures obtained from the census office which, 
in due course, it is hoped to update using the 1971 census 
data. 

The prime objective of the system was to meet two 
requirements relating to television and radio network 
coverage: to determine the population served by existing 
networks, and to estimate the population that would be 
served by a proposed new station. Surveys, site tests and 
calculations can supply contour maps showing the geo- 
graphical areas served by proposed and existing transmitters. 
The computer programs are intended to use these data, 
after being suitably digitised, to determine population 
counts within the areas. Several kinds of run were en- 
visaged. Firstly, a count might be required for a single 
small area, for example, in a u.h.f. television or v.h.f. local 
radio planning job. Secondly, a count might be required 
for a single large area (a substantial proportion of the 
UK), for example, to determine the population served 
by an m.f. radio station. Thirdly, a count might be required 
for a whole chain of stations intended to cover as much 
of the UK as possible, for example, to provide reliable 
information regarding the coverage of a national television 
network. This last case is currently the most important 
as applied to the u.h.f. network. Here figures are 
required for the gross coverage of each station, the net 
coverage of each new station (i.e. the population served 
by the new station which was not previously served by 
any station) and the cumulative coverage of the whole 
network. This involves a number of separate counts on 
the whole data which relates to at present about 100 
stations whose service areas are described by about 4,500 
separate contours. Since additional stations are contin- 
ually being added to the network it is necessary to hold 
a file of current contour information which can be added 
to, and from which counts for a selection of stations 
can be made on demand. 

It was desired to meet these requirements in such a 
way that each operation would be cost-efficient in its own 
right, and in order to achieve this a central counting 
program was written to satisfy the smaller-run requirements, 
allocating as much core space as necessary to the directories 
of the various files both temporary and permanent, but 
keeping the main data for the fifes on the appropriate 
peripheral storage medium. Random access storage was 
used for the population data file, the contour files and 
compacted coverage area files in grid form. Magnetic 
tape storage was used for the permanent u.h.f. coverage 
files. The central program was written in modular form 
so that modifications to satisfy the large u.h.f. counting 
runs could be effected as easily as possible. This last 



requirement was satisfied by two types of run, an update 
run which added new data to the coverage tape file to 

produce a new generation, and a count run which counted 
selected station coverage data from the coverage tape file 
in gross, net.and cumulative senses. 

It was recognised at the outset that data entry to 

programs was going to involve difficulties in validation 
since previous experience of trace digitising had indicated 
that both operator fatigue and paper tape punch unreliablity 
contributed to an error rate which was not by any means 
insigificant. It was essential to devise and implement 
operator procedures and data validation routines which 
would be as near foolproof as possible. 

The trace digitiser consists of a board on which to 
mount the map showing contours to be digitised, a cursor 
with which the operator follows the contour while controll- 
ing the paper tape punch with a button on the cursor, and a 
numeric keyboard for entering identification data. To 
assist the operator in remembering the order of punching 
and to allow better program checking of the data a 'menu 
card' was devised to be attached permanently on the board 
outside the normal map mounting area. The card was 
divided into labelled areas and the operator was instructed 
to precede each manually-punched identification and each 
new contour with an automatic punch obtained by centering 
the cursor on the appropriate area of the menu card. It 
was then possible to check the data sequence and data 
formats very thoroughly in the data-validation section of 
the program. 

To further assist the checking a short validation 
program was used which selected lines from the paper tape 
input file and listed them, allowing a manual check of the 
file to be carried out fairly easily prior to any runs. It 
was necessary to select only a proportion of the lines from 
this file because in many cases a complete listing would 
require some forty or fifty pages. 

The program suite is housed at the UCC bureau Univac 
1108 computer and in order to be cost-efficient maximum 

use was made of their hardware configuration and avail- 
able software. For this reason it would be difficult to 
move the suite to another system, because apart from 
hardware and systems incompatibility, the Fortran V 
language is not yet standardised so that some features of 
UCC Fortran' will not be available in the same form else- 
where. However, access to the programs at UCC is possible 
using only a dial-up teletype terminal assuming that the 
user has the necessary trace-digitising equipment. 



2. General principles of counting methods 

The operation of counting population is, in principle, 
a straightforward process. The area is divided into half- 
kilometre grid squares and for each square the approp- 
riate population figure from the population data file is 
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added into an accumulating total. To achieve cost-effic- 
iency the actual processing has to be considerably more 
complicated. The population data file is arranged in 
ten kilometre square blocks, each block containing four 
hundred numbers. The contours defining areas to be 
counted are first processed to extract a set of ten-kilometre 
grid squares which intersect the contour or are contained 
within it and from each square a data block is set up to 
contain 400 bits which define whether each half-kilometre 
square is contained within the contour or not. These 
blocks of data are accumulated in a random-access file 
stored on a fast -drum backing store. At the same time a 
core- resident directory to this file is set up to contain 
the number of the ten-kilometre square and the backing 
store address of the data block. The key to an efficient 
counting procedure is that by sorting this directory into 
the same ten-kilometre square order as the main pop- 
ulation data, the latter can be read sequentially with only 
the required information selected. Each block is accessed 
only once. The block length chosen was 7168 words 
which, after compacting of the data, contained population 
data for, on average, 150 ten-kilometre squares. The 
backing drum file of area data is read randomly in the 
new (sorted) order, however, since this medium is some 
twenty times faster than the permanent random access 
medium a degree of balance in efficiency is achieved in the 
program between reading the two media. 

From the user's point of view it is also very desirable 
to be able to carry out many counts in one run. To 

achieve this, the areas to be counted are identified by a 
number which is carried through to the backing store file of 
area data, so that when the populations are accessed the 
figures can be directed to a number of different accumu- 
lating totals. To allow accumulation of net as well as gross 
population figures for a set of service areas for a single 
network, totals are formed for all intersection regions 




Fig. 1 ■ Counts for intersecting regions 



between any number of differently identified areas. Fig. 
1 illustrates a typical case. Three areas have been specified 
but to take amount of overlapping seven totals are formed 
namely R1, R2, R3, R1 & R3, R3 & R2, R2 & R1 and 
R1 & R2 & R3. Gross counts would, typically for R1 f 
be the sum of the figures for R1, R1 & R2, R1 & R3 and 
R1 & R2 & R3 whereas the net count for R1 would be 
the R1 total only. 

A further improvement inefficiency was obtained for 
the case when very large areas were required to be counted. 
These might wholly contain a large number of ten-kilometre 
grid squares. To avoid having to sum all 400 numbers 
for these squares a total population figure for each 
ten-kilometre square is held in the population data file's 
directory which also contains ten-kilometre square number 
and file address of the data and is brought to core for each 
run. A special bit in the area-defining data block is 
reserved to indicate whether the whole square is included. 
If so the figure is read from the directory and no access 
to the main data file is necessary. 

In order to allow the user a full range of possibilities 
in defining areas by trace digitising, a single identification 
can be used for as many contours as he wishes. These will 
all be considered as part of the same area. In order to 
specify 'holes' in regions he is also enabled to attach a sign 
(+ or —) to each contour so that a '— ' sign will cancel a 
region wholly within another contour. The logic for count- 
ing is that a half-kilometre square will be included only if 
one or more '+'ve contours contain its centre and no 
'-'ve contours contain its centre. If any half-kilometre 
square centre was contained in a '— 've contour but not 
in a '+'ve contour it was considered desirable to print a 
warning message to the user. 

3. System description 

3.1 Available facilities 

The programs are normally run from a short FASBAC* 
run file to which is appended the input data file which 
consists of a few manually entered definition card-images 
followed optionally by the file of trace data read from 
trace digitiser paper tape. The program file is resident on 
FASTRAND* random access store and is called and 
executed using UCC system utilities. The programs 
access and maniuplate four major data files according 
to the run requested. Input data may be checked and 
set up in a new or existing CONTOURS file, the 
CONTOURS file may be processed to form an AREA 
file, an AREA file may be used to produce a pop- 
ulation count, data being extracted from the POPULA- 
TION file, AREA file data may be merged with a 
COVERAGE file to form a new updated COVERAGE 
file or a COVERAGE file may be set up as an AREA file 
and counted. Each of these possibilities is catered for 
by calling one or more program modules which are strung 
together by short main programs or selected according to 
information from the definition input cards. 

* Readers unfamiliar with UCC systems should refer to Appendix 1 
or UCC manuals. 
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3.2 Data files 

3.2.1 CONTOURS file 

This is a user assigned FASTRAND file containing 
a data base of contour points in sequence for a number of 
contours and a directory with one entry for each contour 
containing data defining the origin NGR and map scale 
for the contour, the identification number of the area to 
which it belongs, its sense for counting (+ve or -ve), its 
number of points and the address in the data base of its 
first point. 

The user assigns the name, surname, password, 
protection key and number of blocks during a run which 
may either enter new data, or add to existing data and 
optionally initiate a population count for its contents. 
It may be either temporary for this run only or permanent 
for the addition of future data. 

3.2.2 AREA file 

This is a program-computed temporary file held on 
fast drum backing store. It contains the same information 
as a contours file but re-arranged into 14-word blocks each 
of which specifies all half-kilometre squares contained 
within a particular ten-kilometre square. A block contains 
the ten-kilometre square number, the identification number 
and one bit per half-kilometre square to determine whether 
it is included in the region. 



remove the bulk of contour data provided it conforms 
to the correct format and list only identifications, 
'menu' points from the trace-digitiser menu card 
and erroneous format points resulting from paper 
tape punch errors. A line number is also printed 
so that the user can easily locate incorrect lines in 
the FASBAC trace data file and correct them. 



3.3.2 

MAIN is the central data entry and counting program. 
Data cards are inserted at the beginning of the 
FASBAC trace data file to identify the SURNAME, 
NAME, optional PASSWORD, optional KEY, optional 
I ND ICATO R which controls whether or not successful 
data entry is to be followed by a population count. 
If the PASSWORD is omitted it is assumed that 
only temporary CONTOURS file is required and 
it is deleted at the end of the run {i.e. not permanently 
stored under the user's account). If a PASSWORD 
is included the program first searches for a FAST- 
RAND file with the given SURNAME and NAME. 
If one is found the new data is appended to this file; 
otherwise one is acquired and the new data entered 
into it. If an error is detected in the input trace 
data the checking continues but a requested pop- 
ulation count is aborted and the new information is 
not written to the FASTRAND contours file. 



A directory is maintained in core which links the 
ten-kilometre square number to address of the block in the 
data base. 

3.2.3 COVERAGE file 

This is a user's permanent magnetic tape file of 
regions using the same data structure as an AREA file. 
It is sorted in major order of identifications (the order 
being specified by the user) and in minor order of ten- 
kilometre square numbers. 

3.2.4 POPULATION file 

This is the population data base resident on FAST- 
RAND. Its principle component consists of variable- 
length blocks, each of which contains the populations of 
the 400 half-kilometre squares making up a 10-kilometre 
square. It also contains a directory, which is brought to 
core during a count run and which links each ten-kilometre 
square number with the address of the data. The data is 
highly compacted using an efficient variable length code for 
integers, which was described in reference 1. It is trans- 
ferred to core in 7168 word blocks. 



3.3.3 

MAINP outputs the data from a CONTOURS file 
in the form of a plot for checking or other purposes. 
The plot consists of re-drawn contours together with 
scales, grids and NGR's. The appropriate cards to 
call up standard plotting software must be present 
in the run stream. 

3.3.4 

WRITE lists outline information for contours from 
a CONTOURS file and optionally lists the contour 
points for each contour. 

3.3.5 

DELETE deletes a user's permanent CONTOURS file. 

3.3.6 

MAINR counts population within circular areas. This 
facility avoids the use of trace digitising but is clearly 
limited in scope. 



3-3 Main programs 

3.3.1 

CHKFIL is a line by line format checking program 
which selectively lists the input data lines. according 
to a set of rules. The purpose of this program is to 



3.3.7 

MAINA is a program which opens or updates a 
COVERAGE file. The input is from a CONTOURS 
file, the information from which is merged with file 
information and output to a new generation COVER- 
AGE file. 
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Fig. 2 - System flow diagram 
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3.3.8 



MAINB is a program which counts population for a 
selection of regions from a COVERAGE file. 

3.4 System flow diagram 

See Fig. 2. 

4. Operating procedures and program checking 

4.1 Preparation of contour maps 

Contours may be drawn on maps or map overlays to 
any scale which allows the traced regions to fit the trace 
digitiser board size. The trace should show a 10-kilometre 
grid origin point near its bottom left-hand corner and X- 
and Y- areas on which are marked a point on each area at 
a chosen distance in kilometres on the map from the 
origin. These points are used to indicate the scaling of 
the contours and should be as far from the origin as 
possible whilst remaining within the trace area. Each 
contour should be closed and to assist tracing accuracy if 
many contours are present they should be marked with a 
number, sign identification number and first tracing point. 
Fig. 3 is an example to indicate a suitable form of overlay 
for trace digitisinq purposes. 



Or 




2,f,105 




5.+.107 



6.+/M2 







P 



Fig. 3 - Map overlay for trace digitising 

Each contour is numbered and its sign, area identification number 
and start/finish point are marked 

4.2 Trace digitising procedure 

A standard 'menu' card and the map or overlay are 
first set up on the trace digitising board as shown in Fig. 4 
(which also illustrates the 'menu' card}. A complete 
trace for counting may include any number of paper tapes 



menu 
reference 2 

cancel 
new axes 

new menu 

end neg- 

contour 

end pos- 

contour 

begin 

contour 

idenl 



N G R 



menu 
reference 1 



□ 




map overlay 



menu Cfird 



Fig. 4 - Placement of menu card and overlay on trace 
digitiser board 

and new maps may be loaded as desired. Contours too large 
to fit on one map may be divided into two or more parts 
with straight line divisions. New tapes begin with a re- 
punch of the menu card reference points and new maps 
begin with a punch of the axis points (0,P,Q). Whilst it 
is desirable to set the map on the board with the X-axis 
reasonably nearly horizontal, any departure from hori- 
zontal will be corrected for by a transformation in the 
program. The last point punched for each contour 
must be near the first as possible, i.e. all contours must be 
closed. 

The order of punching is given below. The trace 
reading program operates on the principle that all the 
required information must be given at least once prior 
to any contour tracing and, once given, any item remains in 
force until over-written by a re-punch of that item. Faulty 
punching may be corrected by re-punching from the last 
menu point including that menu point. The 'cancel' 
menu point cancels the last line of data and may be used 
up to six times in sequence to cancel the last six lines 
except that the menu reference points may not themselves 
be cancelled this way and must be correctly punched 
(otherwise the run is aborted). 



Recommended order of punching 

1. Punch menu reference point '1'. 

2. Punch menu reference point '2' 

3. Punch point '0' 

4. Punch point '?' 

5. Punch point 'Q' 
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6. Punch menu point 'NGR' 

7. Keyboard punch 'XXXYYY cr If 

XXXYYY are numeric co-ords of origin in 

ten-kilometre units in National Grid system. 
160 is added to Y-eo-ord for Northern Ireland 
and the Channel Islands. 



8. Punch menu point 'SCALE' 

9. Keyboard punch 'DDDDD' 

Numeric distance OP <= 00.) in kilometres, left- 
justified one to five digits 

10. Punch menu point 'IDENT' 

11. Keyboard punch 'DDDDD' 

Numeric identification number for following 
contour or contours, left-justified, one to five 
digits. 

12. Punch menu point 'BEGIN CONTOUR' 

13. Punch contour points 

14. Punch menu point either 'END + CONTOUR' 

or 'END -CONTOUR' 

15. Return to (12) for more contours 

or (10) for new identification number 

16. End of punching 

or 

Punch menu point 'NEW AXES' and return to 
(3) for new map or axes 

or 

Punch menu point 'NEW MENU' and return to 
(1) at end of a reel of paper tape. 



punching, 
were: 



The most likely errors found in practice 



(1> Missing out the 'end contour' menu point. 
(2} Failure to trace a contour back to its start 
point. 

(3) Incorrect punching due to the trace digitiser. 

(4) Faulty extraction of the co-ords of the map 
origin. 

Of the above errors (2) is easily detectable by a 
programmed check and (3) is almost always detectable 
since the program was written to include a character- 
by-character check of each line of data. (1) and (4) are 
checked by comparing the output listing from a run against 
the original map and overlay. Should errors be detected 
at this stage a re-run is necessary with the corrected data. 



5. Population count for the UHF Television service 

The requirement was to produce a complete count for 
the u.h.f. television network showing gross, net and cumula- 
tive coverage for each station in order of date of opening 
up to the end of 1972. In all 92 stations were counted 
from 450 hand-drawn map overlays on 1 inch Ordance 
Survey maps. Each overlay had on average 10 contours 
together referring to up to 5 different stations. It was 
also required to produce a data base of service area infor- 
mation which could be updated as new stations were 
opened. The operation is described by Fig, 5. The 
information was batched into convenient trace digitiser 
runs and each new operation for each batch of data 
gathered new items for that batch. Each batch was kept 
in a separate envelope to which was attached a 'progress 
sheet' which was initialled by the operator or engineer 
for each completed process. Thus it was possible to handle 
and process many batches at a time without fear of repe- 
tition or loss of data. 

The initial phase resulted in the preparation of 92 
CONTOUR files, one for each station occupying 170 
Blocks of FASTRAND storage. The final checks indicated 
a few errors. These were corrected by re-tracing, adding the 
new data "to the CONTOUR files and deleting entries 
from the directory where there were erroneous contours. 



4.3 Data validation 

The above trace digitising procedures were designed 
both to allow operators maximum freedom in correcting 
errors that they notice at the time the errors are made and 
to make the detection of un-noticed errors as near fool- 
proof as possible. Apart from allowing double checks 
in the program, the 'menu card' approach is valuable in 
reminding the operator to include all the necessary data 
in the right order. All the operator has to do is move the 
cursor up the menu card and, at each new point, read the 
name of that point and then follow with appropriate 



It remained to carry out two runs: MAINA was run 
to form the coverage data base and MAINB was run to carry 
out the complete population count. These runs took a 
total of 8 minutes 1108 CPU time, indicating that the 
design of the programs for efficient running had been 
reasonably successful. 
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Fig. 5- System used to count UHF Service Population Coverage 
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Appendix: UCC Facilities 



The main frame computers are Univac 1 108 operated in automated remote batch mode servicing a variety of Remote 
Job Entry terminals. A subsidiary time-sharing system, FASBAC, services dial-up teletype terminals and has file storage and 
editing facilities and acts as a terminal to submit 1 108 runs and receive output. 

The 1 1 08 specification is, in outline, as follows : 

52K x 36 bit word core store available to user 

760K word drum backing store - random access -4 ms average access time 

22M word FASTRAND permanent file stores - random access - 90 ms average time 

8 x magnetic tape servo. 
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